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Abstract 

The present invention relates to a polyester synthase gene coding for a polypeptide containing the 
amino acid sequence of SEQ ID NO:2 or a sequence where in said amino acid sequence, one or more 
amino acids are deleted, replaced or added, said polypeptide bringing about polyester synthase activity; 
a gene expression cassette comprising the polyester synthase gene and either of open reading frames 
located upstream and downstream of said gene; a recombinant vector comprising the gene expression 
cassette; a transformant transformed with the recombinant vector; and a process for producing polyester 
by culturing the transformant in a medium and recovering polyester from the resulting culture. 
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aBft«L<«f=f»OShfcE?>J«:^. #yx*?-;u 

CM*! 2 ] r«3h*ttSEMe£tr# 
"J x A TJlM'&Wmj&izrF-. 

^«*-^>y-fw>y7u--Aj&s % se?ij#-^4r 
5 ] # y xx^;ug^S8S»£^©±ifciCS 

«3h**aiawtdOfeflDr**i»«3i3iattojS£E 

TS^-tevh. 20 

cw*a7 3 #y x^f-;um^e^iteTcDTifeccs 
cis 1 0 ] n^i 9 sm<Dfcnw&»*i*mcm 

£T£*yxxf;u tMGBfi. 
(HM&I1 1] *tfx^fW :>c5fil : 40 
Htl] 

R 
I 

HO-CH-CH2-COOH 

( R«**KTXtt£OHR 1 - 4 ©T-fc*A'S** 

CW3RJ11 2 ] *'jx^f;^ #y (3-fcKo* 

A±fcfi^»r*5is^i oiea©*yxx^u<aRii so 




0- 1 0 8 6 8 2 

2 

[0 0 0 1 ] 

y x x ^UflMJJfc&ffittiBB'r * . 
(0 0 0 2 ] 

Ct£*<?3:»j] fi» < <D»£«Jtt. #y-3-fcPp + 
J/y^U-h ( PC3HB). ) *r£*J«U x*;L>=F-©flf 

-S.'SSflttt^^^ffltBO/c PC3HB) "fit, ISO'CllJgCC 

> M 77x^7 Libras 5 nri>s. *fc. poh 

£. L*>0tt*S6. P(3HB)«, ^fealrtt^^CO/c^CC 

(0 0 0 3]fi^ 3 - t Kn + i/^u- h (3HB) <b 
3- t Ka*^**^x-K3W) ±<D 2 
#»Jx^f;b PC3HB-co-3HH) JsJ:tf*©SHSffiK:-x> 

T. W35. BMfatffcSfU fc<bx«> 5-93049* 

<&f8fe<fc^^^7-26so65^#Rcc*n^n?eag$nr 

Cft6<&2tf8<Z> P(3HB-co-3HH) £S#f*©SBfi 
ffitt. ±±9#»Ufc7xa**X • fcTX (Aeromo 
nas caviae) *fflC»T5fl/-ir>H"f»*y-7 r *-/^6 

^s^i-rsior&s. «Btsufc p(3hb-co-3hh) 
3hhjlx v b fi-motm ttt> aa&ikitg. 

ec4Jnxr*sc<!:3ftiW6*K:sti'Ci»* (y. ddi, s. 

Kitamura, H. Abe, MacromolecuTes 28, 4822-4823 (1 
995)) o U*>l/«tf*6. »W¥5-9304**&«:fe<fcC«# 

7UXtX^ (K«8S^»*rt<0#yx^fr;l-a«) *HBC> 
?c#>, PC3HB-co-3HH) **'&^yx 4 XT-;U«:jftjR*r4 

(0 0 0 4] 

$1^^^ ^ * - <C cfc o r?BK«fft S n/c^®ia^SO' 
B3BWK»i*tflit*te#y xxf-;KO«HI*ffi«rli«T- 

CO 0 0 5 ] 

fzmttic fm t a ±i3tacrF«o * - ^ > y - ? << > 



(3) 
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[000 6] t^b^, SB^J*-^2ra3tl 

3 - ft s * y x^f-;us^s»a{5^r* a. sag 
t^ww^ns, io 

[0 0 0 7] S6«C, ftffe* y xXr-JUM^ 

y - 7 s a > ^7 u- Ao^rti^— 3r£ striae 
tiie^js-^ 3 ) tmmti* # y x* 7*;nt#s?5sa£ 

&ls<KttmZtitdBfrl&'£fr* xy^/U-CoAbF 
(0 0 0 8] CCt, *««IOiPi;x^f-^»*B3Ril 

smmMcte^x 1 fs^o < jjsffloT 5 -/scc*^ 

w;x^f ;ua-£r£te£ 5 R 

[ 0 0 0 9] $ #ft9Jtt. Hutd* »; x^r-juM^- 

«c J: o rjgmstts ftfcJBWSftfcr . 
CO 0 1 0 ] $^C, mSBME&fteeill 

C 0 0 1 1 ] 
(ft 2] 

R 

I 

H0-CH-CH 2 ~COOH 

COO 12] (RIJMI^XiSKSSl-407;^ 
;u**«-r, ) r^Stia 3 - b Ka+i/r^*>K<D 
^M-£<* («*tf, tfy ( 3 - b Fa+i/^u- h - 
3-b Fa^^tf yx- r ) 7>^'A^M^») * 50 
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Co 0 1 3 ] 

CD ^'Jxxf ^M^P^it^^co ^ a - x > ^' 

^f-;u«^aw«e-f *wrsB*^s«re#D na* 

fEKT*. B»iLm «x!tf7xa^-^ttf 
x (Aeromonas caviae) aWJttf fctl«5. 
(0014] tt&ffDNAOBWa&«DO^re<:A^S 
7xn^-=lrftx^LB 

X"i?Affi (Currnt Protocols in fctolecular Biology,l 
^, 2.4.3 H, John Wiley .SSons tiJ*S , 1994^) ^CC 
*9»&ttDNA*M»rS. 

[0015] ±3BO#ffi«CJ:.Of#e>nybDNA€:3a^^ 
<0!*.tfSau3AL BamHI, Bqlll^) "Cgfr##fiS 

u&a. 7w;t^7 7 *— tf»a*fft\ dnak 

>Hftr*. Cft*#IIMHR (0l*.t*BainHL B 

gin go twuc^^-i7'{y->3>«f 
(0 0 1 6 ] «3E8Sfife«f-CS»W«:i»SI 

^.^-iim 'ffl*.«Ef-BL3 . M13 > A gtu3?#* 
Cf^tl. 77XU*<^^-iim #l;LtfpBR322* 
PUC18 , pBluescript II ( STRATAOENEtti^) ^W^> 

[0017] DNA(K>fri^^*--K>1-t*aaSSS#* 
ktt, fittODNAyrf-lftiW. DNA 

[0018] S^£»fc^A^**-*«A"r*«: 

JS^^T^JiSiOtS^U^^^^Affi (Lederberq.E.M. 
etal. J.8acteriol.l23,1072CL974)) ^XU^ ha^T 
U — ^3>S (Current Protocols in Molecular Biolo 

pv, i*. 1.3.4 1, is94^) zmmrzcttfiv*, 

Wttt&M&y r-^DNA<Dii^{iW>b ha • /S y 
tr — is^tf& (Current Protocols in ffolecular 8iolo 
qv, 1«. 5.7.1 M, 1994^) «»fflT4Ci#rf 

(Cigapack II ; STRATAGENE ?±*J^) ^l^oC i ^> 

[00 19]^ 7xo^t7'^ttx(/)^'jx^ 
^M^P^il^^^DNAtBrK-^f^^/c^cD^n 
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CCOlvCte, fflcWmM&Ob<b&%Q*>tlXi<>2> (Peop 
les, O.P. and Sinskey, A. 3. , J.Biol .Chem. , 264, 15 
293 (1989): Huisman, G.W. et al . , 3 .Biol .Chem. , 26 
6, 2191 (1991) ; Pieper,U. et al . , FEMS Microbiol. I 
ett., 96, 73(1992)ffe) . ^Ct, C.tl6©T5-/»ig 

jwd^s, «??sn-ci^2o<D««*aiRo. -en* 
=>- Kr*«tt«»iB9J*«eor*';=f^u*^ f 

tf 5 '^C(C/QCC(C/G)TCXlATCMa/QAACrCT/A)(T/QTACT 
/QATC-3 ' (HW»#7) % R^5'-(G/C)ACCCA(G/QGC 10 
(C/C)GrCCA(A/G)TC(G/C)GGCO\CCA-3' (E?J#-^8) "C 

asn£2@£io*'j fW6tlS)^C 
n^teffiSSnSfeO-Cttfti*. 
(0 0 2 0] Ch6M'i^>7Ut^F^7 7>f7- 

0T*'V ^ ^-^ilUSfc CPCR; Molecular Clonin 
q, 2m. 14.2M, 1989^) £?t5. fOt, PCR<C<fc 

c o o 2 i ] jjhc, cos*fl»*nRK-*3asttttii*ffl 

^ra»u. mflc^fe^D NA7^7 y-jfcfcjp- 20 

-A-f^'J if-f •fe'-i'aVStf 9 (Curmt Protocols i 
n Molecular Biology ,1*. 6.0.3 H, 1994*£) . 
[0 0 2 2 ] 3Di^^'J #4*£—\ a 
^7 »; -z>^Shfc^lS^?»7;^ (Currnt Pro 
tocols in Ntolecular Biol ogy , l3§ ,1.6. lH , 1994^) 

dna WrK*#f#6 n a. 

[0 02 3] ±BD N AKJtO*«^JOtfeEW, ££fl 
2f&. Pdx.{^>^-?i (Molecular Cloning, 2^, 13. 

3M, 1989^) ^c<fc^rtf ^ci^rt, ttssjus 30 

W*tf373A- Cm ->-^X>tr- (Applie 
d Biosystemsfct) ^ifH^Xff? C tftVt &e 
[0 02 4] 1 CC**9J©# 'Ji^fJU^i 

s # ^ ^ f *i # x ^ f - ;u « - &®tt * t re 6 r 
[oo25] ±iE^3fe»<oscatt. fittoewas? 

*SSSSIS^S (Current Protocols in Molecular Bio 
logyl#, 8.1.1 I, 199430 f <Cjz*)W&kT2>C£&-C 

^CCj:or, Xra*fettDNA*SIS£0/cPCRffi 
ftC J:.-* r . £>£i, > (^SSS^I rWT SDNA Kit*: 7' 
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[0 02 6] (2) »KIEftflKDfl*{ 

access nr. ; pu.ti, r/i>* yy**jgKjir£S& 
F^x/gccjsr*®^, '^rt^JSK 

Co o 2 7 i r;u*yy*^JK«cjsr.s{SSfe^ ^>^-- 

^RK2aSSfi** : ffr5pLA2917CATCC 37355) , 
«RSF10lOlSSf4a^WTSpJRD215(ATCC 37533) »W 

[0 0 2 8] S3E*"C»3JI-C* 

^ iot^n^uf n^i^t ^ J:^, ft&tx. trp -?* 

M-^-, lac yo-t-* — , P, — , P 

R yC2*-^-, T77'C3^r-^-^i'CD^J^a^7 7'- 

^ccfi^fers^o^-^-^ffli^ns. anemia 

-<;?J">^;#H (Current Protocols in Molecular 
Biology, 1*; l.S.lH. 1994^) , XU^hD#U- 

>*3>££ (Current Protocols in Molecular Biology, 1 

«. 1.8.4 R, 1994^) •jWtfBfti. 
( 0 0 2 9 ] P«*lBiiCrffl^4»^tt, -|%5i-<^ 

5?-iUt, W;U*YEpl3 , YCp5Ca?*SW6n^ 0 7* 

D^-^-am Wxtfqal 17'0^^-, gal 

S (Methods. Enzyraol . ,194,182-187(1990)), * 7 x P 
7'^^ h& ( Proc.Natl .Acad.Sci .USA.S4 ,1929-1933(19 
78) ), ? ; * V Affi ( 3 . Bacterid , ,153 ,163-168(198 

3)) IfW^ni). 

[0 0 3 0] lb«lttlia«:S£^ Itfflt^^t 

^-CWittfpcONAl. PCDNAI/Amp ( V > f h D 
^*>tt) SF^fflC^nS. Hl»MISx.ftDN 
A©iSA^ffiiL/r«, ^4^«, xu* hatfu-i>3 

(0031 ] ccr, ifriEcoi^tcorsyesn/ttts 

I2^JU, ^lix^f-^S^gSiHS^O^iC. ^-<7)± 

^ x^ r^m^-P^ite^s (O^a*- 
* - M«03SeTCc4> 4< it 2<1<00R F <b <b i>tcjr 



(5) 

7 

.[ 0 0 3 2] * -;x^^;ni^^iteTo±t?£^{4g 

TSO R F ^tTF rORFlj <hl^\ T«McffiH-rs 

ORF*«T TORF 3 J <hl>5. ORFltt, 

# y x x r ;l/<DS«<c £ it&t-Xte #VxWi/ 

#y xxr;^^ic^T^x^^ju-C oAth"5 
*~ fe'(1#(t(R) -HIKx/^b-C oAtF^ 

( 0 0 3 3] h i ic^-r J: 5 (c. #&J9Mfflj io 

em (mi cu «c*j^t r-35/-ioj twft) , ^'jx 

*f-/bfi$BS5jie^ ORF1MORF34SOEC 
oRIK#**p (0 1(1) ) , C<7>tftK-£ 
EE32<hT£o 

[00 3 4]3CfiC. EE32(C:feU>TORF lXliORF 

(0 0 3 5] »K«iai«J«iOFRleOHR 20 

Htt««<bOBB. RCXOFRl©fflRff±«*i#»;i 

fMKlBRBgUI aJfi^AL. Bglll KJ: 0 ORF 1 £ 

ttc-ttz (Hi (2) ), chiiaaccbr. #"»;x* 

r *S«lMU^QmUt±flM 4 O R F 3 t <DVA K 
*JEB3RBan«ia««:*A 0 , BamHI&SK: J: 0 OR F 3 
(HI (3) ) . 

[0 0 3 6 ] ORF iac;ORF3C0Sf^§tS 
lc«. EE32«:o^T\ IfdORF l£tfORF 3£X 
ffeStf5fSM**W#tTAtfJ:c> (SI (4) ) a M 30 

^#J§£M& (Currnt Protocol? in Molecular Biology, 
1#, 8.1.11, 1994*).KJ:^Ti»A«Cfc**r* 

(0 0 3 7 ] c © j: 5 cc o rf# 6 -ti/c*n-eno ae-^ 

RD215 (ATCC 37533)) (C»AL, 1S 6 £ 
-SrJBl>T, TJU^7 'JW^ • a- hay 7 A (Alcali 
genes eut rophus) • PHB^4 ft ( DSM541) (*'JxXf;l/ 

[0 0 3 8 ] (3) # »; x* t-;kd§^ 
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[0 0 3 9] r;u*yy*^K«cjR-rs«^ftXtt^* 

- k * * mtcm? s €rmi 4 u r » e> 

i/»4«ais«*^x.. ass. te«fass^^*as 

- o . 1% cc sups l tcmwm-f 6 ft * o 

[0 04 0] «JR»«a^«(0it WCiSWr h 0 . ^ 

[0 04 1] ^^®4 Ut;«, WxtfTV^-T, a<t 
7>^X^A, M7>^x^A v 'J >g7>^x^A 
^07>^rX.O AffiOfife, ^x=^X, iSx 

turn m^ r ;>^-*';^A, »;>mm~^'; 

•?A. y>»-7^*J/^A, K«7^i/^A, feft^ 

(0 04 2] cb'CDjf^^ 
T. 25-3rcr»KB«a24«FnBfeLb <«m«i^7 

(0 04 3] BaH40^c*-^-S:fflt^c#^^^^ 
fJU-/3 - D-^^-*'7^ h t*7 -/i/ K (IFTC) , ^ > 

f-;ut^ «;;uk-cim) Jfitcssarrsc ttifi-Ci 

[0 044] tJj^ffl^^Ttii UX»6n/c?BKiQfe«: 

6ft ii«5%CO,ff?5ET, 30-37'Cri4 
— 28BfBHf5. JS«*tt*^v-ft/>, -<xvtj 

[0 0 4 5 ] *«W«l*Jl^'C. ^'JxXf^O^SWl 
xBHTOJ: ^ 5Ctf 0 C i ^T' # ief«SE^6«-C^ 



(6) 

9 

raw* yxx?\rt, *»•«.- 

CO 0 4 6 ] ?#6n/c^yxxf-;^aWOfcO"C5>-5 
• tfx^5*jKL/c^ i ;x^7 : -;uS-&K^ 

XI : 
(0 047] 
Cft3] - 

R 
t 

HO-CH-CH2-COOH 

(0 04 8] (Rtt;KJOT^XW««»l^-4©7/U* 

hiU/cM^(* (#«;xxf;i/) 

tf#V (3 - t Fn^S^^U- h- 3 - t Fn+'>^ 
*tr,/x-r) ^>^AftS^f* ( P(3H8-co-3HH) ) Q 

teP(3HB-co-3HH) Z&&X7^^V^m.?2>n&3$:^ 

■r. 

(0 04 9] S£*-C«, sFV-3-fc Fn*S/^U- 
h ( PC3HB) ) *4tMS#y 

h-3-t Fn*i/^yi/-F) ( P 

c3H8-co-3HV)) osiJSfficcocirwss, Hfttfttsn* 

fc#. cn60^yx^f*;U«lSMSttlS^©/cto(c 30 

www***** ti**> «tt±co mh^ s> * • 

(0 0 5 0] ^S6CD3 - 1 KP^ + 1f/x- h 

fflCift:S£*<DPC3HB-CO-3HH)SBSft (1#P3¥5-93049^ 
[0 0 5 1] Ch^O. *»?Br«P(3H&-co-3HH) ft 

s^y x^^uiatiDw-c^Braci^r*"*. ± 40 

*^ajCDS{5 : f : - ; &S^TP(3HB-co-3HH) ftf^'Jxx 

war ^c^t^, 

(0 0 5 2] 

vm?z>. fiu chb tfCx:<D mm 

CXtkMl 1 7XC^^ • ^tt'X^'JXXfJH 50 



«H¥ 10-108682 
10 

^S^iteT-O £a-x>y 

(0 0 5 3] TXD^et^ - ^tfxFA440Ml0 
Cm! <DLB*£itfi ( 1 hx^, 0.5 %h'j7> 

>. 0.5 %Mthy)A, 0.1%^D-^, pH7. 

5 ) 3o # cr*ss^*ofca, MAt^vrty^h y 

^^UTV^x^Agi (Currnt Protocols in Molecula 
r Biology, 1« , 2.4.3.M, 1994^; John Wiley & Sons 

HUE) K£9$fefe#DNA£f#/c,, 

(0 0 5 4] {f^nfc^DN A*fWRB3RSau3AI-C 

X * K"<^^-'C^>^pLA2917(ATCC37355)€r^fflL^ c 
CCD^7X; F«MRlHRBglII tMU, Jfty>Sft 
(Molecular Cloning, 1#, 5.7.2 H. 1989*£; Col 
d Spring Harbar Laboratory fcHJK) £S6bft:gL DN 
Ay*'- **rfliur*6ft D N A aKJ^JBKK- <h 2& S 

(0 0 5 5] C<DilttDNAKrtS:ffl^fc-f>fcrha ■ 
/\"?>r — i^V^ffi (Currnt Protocols in Molecular Bi 
oloqy.l*. 5.7.2 M, 1994^) t,Cj:-?XjzM&SX7-l 
ftZmmW&U. 7XD^n • +1- trx^fe^DNA 

[0 0 5 6] JfcCC. 7xa^tX • * * trxcD^ 'ixx 
fvt/S^SStitfi^Sr^tf D N A K>i-%f»S /ca&O^ a 

-^wssufc. cti*rcc*n6nrir^ftao*yx 
^rM^©7 5 -/MEfljr J: < «ff S4vC(r»4 2 

'-CCCCyG)CCCC/C)TCCATCMCT/C)AAC7r(J/A) CJ/Q 
TACT/QATC-3 ' (15?iJ#^7) . RO'5'-CC/C)ACCCA(G/ 
OGCCG/OCTICCACA/QTCCC/OGGCCACCA-B * (EyiJ§-£| 

8) rssn5 2«a©+y rf**u** f*-&jssu 
(0 0 5 7] cn^ot'i^^ut^F^^^?- 

ib, 7xa^ - **fc*x©3Sfe#DNA£§SSJ£ 

p c Rfficc^ # y ixf jn^i^iifii 1 ^ 35 

7rif*iU/c 0 PCR«, 94-C-C30#. 50'Cr 30#&a'72 
*CT'60#(DJgIc;^ 1 *Jl>£ L-CCft£30-$r-< ^;1^t 

o/c. co^WBKtfSrDic dna CHSt* ? h (-c-r y 

(0 0 5 8] fl^Wc^D-y^t^Txn^t^ • 
+ tt'x»6(j:DNA7^7 ij^e^ax^^T* 
y ^^-fe'-^a >^iCcfcr>r#y xxf;H^SSige 
«*tf^^3 F*WrS^IIB«r#*U/c. C<7)X 
»B^6r;W yffitcj^r^x ^F£@iRT*C i 

7c. CCO^>t^8qlII-EcoRI »f)t(COC^f 

ior m ? n -5 3 . 2 kbpfKtf<Di£SlEW t 4 ^ S *i /c . 
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(0059] $ e>ic, cot&m&mc^xftmmkm 
^fo/cg^ c©3.2kbp©*«ie^jo*«:i3:. iayijs 
i xm s iis*aie^J(i785bp)*str# y x* f^ux 

3- FSft**>^£JW*. # y 7";^&irojtH J ?- 
co o 6 o ] iaw#^9Xttior*sfts*SE 

W«rWr4KK"«c*Jt*r 1 ±iai785b P CDtt5iE?iJ(DTi* 10 
*C??£eT « 405bp ( O R F 3 ) &£rt£¥*SSSI 

«L MC^«:±i*«C»a-r4 354bp ©2£f (OR F 1 ) 
3iV#S3IBKB««*BlfiLfc. ORF 1 <&ttSE?J*ffi . 
WM3, ORF 1 WO 3- F3h47 3 ^BE>iJt 
IS?']#-^4CC, ORF3<Dtt»?^J*:EW#^5, OR 
F 3 «c J: 9 n - F S ft *T S -'KIEflteEW*^ 6 (c^ 
t. ccr, ORF3WJxxf;^M#T6 
i^^-CoAt h*7f-^3- KT£iS£^CDfc 
CDr^^o ^L/r, ORF 3^CJ:0^- F3ft*T5 ^ 
■=6*T4#y^^ F^x^^-CoAt F^#- 20 
HsTStt. ^iC (R) -^fc/^-CoAt 

(0 0 6 1] 7A*yW^ • ^- HP? 

sgfttt i r me s ft orfi. # y * 
^m^p^ite^, 0RF3ao*i£^<s«s«*s 30 

tPBQlil-EcoRI K#©BplII»ft*EcoRiy >*7-£/B^ 
rEcoRlBBfifci U 3.2kbp<DEcoRI-£coRI&r)t ( E E 3 2 

»fJt) cft«r7^#y^**«tc«TS^» 

tp-C&maJi&tey'^Z 5 FpJRD215(ATCC37533)K3$A 

u S6ti/ciM^7^5 Frr^ayy** • 

PHB-4 # ( D5M541) 'JXXf ^J&fi&XJS 

C0 0 6 2 ] sr, c©fis^7*7^n> 

ssftofc. ccDffi»x^s®ir^^';y^x • a- h 40 

P7tXPHB-4 *fc*LB*ttl.5nn 3O'CT?**30g* 
0, *ft-!fft<O»«5K0.Unl*ift'&O, 30'Cr4^r B 1ig 
SL/c, C<2l§#^?g£MBF§xiS& (0.9 % U > 

®— 7^h>;^A. o.i5%';>®?— o.o5%^fc 

7>^-0A, 0.S°/o7;^h-X, 1.5 %^X. 0.3m 
q/ml*tv^t/» K&ffiU 30 a Cr 5 BBU6«t/fc. 
(0 0 6 3] Sfft^ASI*^^^ $ F&Tfr* V tr 
■ JL- FU7 r XFHB-4 SlCSieriQ(:^t7>( 
^>Irt*^TC£*p6, MBF»35*«Lhr*t«0.fc 
3ai-tt7Wjm • -3. — ha^T^JBMESM* so 




«fH¥ 1 0- 1 0 8 6 8 2 
12 

T*&£. CCD*^ 1 flOnn x-*:|ijKO. TJU^y 
n^-a-FD7 7 XAC3 2ft («T, AC32S 

X^XllSWiW^^, FERM P- 15786iLTSFIteft 

(0 0 6 4] ZWC&f&t'J^ZZl'** F*/Blr>fc«S 
{£#^£iftlE^££ (Currnt Protocol sin MDlecular Biolo 
gv.iS, 8.1.1 I, 1994^) &C<£oTEE3 2WrK<P<D 
ORF ia^OlW«C*ft*ft«BRlBISBflin 8M4* 
iAU, Bqlll-BqIII & c <L tc<£ o rO 

RF liaGT-*s^3teL/c»r^fP»L, 77^$ Fp3RD 
215 (C»AUc„ C(DffiW77X; F££^T, ±i£ 

©s^esfficcjc^rr^^'jy^^ • a- hP7r^ 

PHB-4 «c*»«WftLAi. »e>ftfc*«68M*«r* «T 
AC 3 2 l*±i«Mc. - 

(0 0 6 5] RHWc, »tt«»fl«Sffi«cJ:-jt E E 3 
2Br)t*©OR F 3 2&¥-<0ira«:*ft-eft*iMBjRBa 
mHl3H££zgAU BamHI-BamHlftffJt^^^-^-SCiCC 
J:->rORF3Se^-^^bfc»rK-^S8L/. 77^ 
3 FpJRD215 CCffALfc. COffiax.77X5F^ 

±aos^easicj: ^xt^h • a- h 

D77XPHM »t««R»Ofc a f»6hfc*WEft* 
*. HTAC3 2 3»iP^. 
(0 0 6 6] n«CC. E E 3 2 »rJ+*<DO R F 1 
OfT^C^ft^ftMPSS^Bqlll SHft*. ORF 3ae 
^iofa^^ft^ftS'JKSSSBamHlgPii^AO, Bqlll 
-Bqlll K>tfe J:^BamHI-BainHI»r>t€:^3l** C chCC 
iotORFl itfi^fc £ O R F 3 Se^ftCC^ 
L/dKJi^tt^U 2 FpJRD215 «:#AO.fc. C 

<Dift^7 , 7XU , »r, ±x&©3^{SS&cc<£o 

XT toil ' ^- r 07 7 7% PHB^4 ft^rfflgft 

G/c »6ftfcfl?«Efti*«:. «TAC3 2 1 3^ii¥ 

(0 0 6 7 1 EE 3 28ffK-^^S<bU, PC R 

sec j: ->r* y x*f-ji**&MRae?*iMju »e» 

7? x**^ y xx ^;u^^5tfE^-oi§^l^fpfI^i 

(E3ftSte««iOratC»AUfc. PCRte. S'-ACTTCCC 
GCCrCGCCTGTCCCTOU-3 ' (M^l^TL) ^5<i: US -GCCATAT 
GCGCrCATCCGGCCrCCr-3 • CK^J#^12) ?:77^7-i(, 
t. 94°CT30#. 55*Cr3(*Sa f 72'C"C6(«*©SJ£€: 1 

(0 0 6 8] CODNAK^7'7^ 5 Fp3RD215 iC» 

AO, fi?>n/dffix77^^ F^ffl^r, ±Mog^- 

SlffifiCcfcotT^* y • a- hci7 7 x phb-4 

&~Bn&8kLtc. »6ftfc»««tt**^ OTA C 2 
9&<hP^«, 

(0 0 6 9] (XSW3) 7^'jy*x jx-ha? 
7;^ , jy^X-a-hU7 7 ^H168 f PHB-4 
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13 

ft. AC 3 2 ft, AC321fic, AC 3 2 3 ft, AC 3 
2 13ft, AC 2 91**. 95m1<DMB*§ifi* 
(0.9% , J>BZth i ;«)^ l 0.15% *7 
A s 0.05%£{t7>^~?A) CC lmKDl %*^£>^ 

'CT^«L/c. AC 3 2ft, AC 3 2 1ft, AC 32 3 
ft, AC3 2 1 3ttMAC2 9tt6C^ttt/7t7-f 
^>*0.2g/L©^r$WSttfc. 12B$H* 24B$fffl, 36 

mtRZfASHnmatAMMtx 1 mlcD 1 

&QS0.5q) , 72BSfHJ«*l/fc. 
[0 0 7 0] H 1 6 ft, j&O'A C 3 2 1 3 fittC-^r « 
±a©MB^»«Cl ^sf-'J-^ift, ^-Affi, =J-> 

30*C-C72^ra^*L./c a frte, AC32 13 

(0 0 7 1 ] H 1 6ft, AC 3 2ft, AC 3 2 1ft, A 
C 3 2 3ft. AC3 2.1 3fl?tcoo>r«±aOMBSift 
tc l mio l %"N7*£ >&i- h y ■? A£fljx-/cigt&ccflB 20 
0. SP7^^, 30°Cr^SU/c 0 tt*5, AC 3 2 
ft, A C 3 2 1 ft, AC323ft, RtfA C 3 2 13ft* 



(S) ^?1 0- 1 0 8 6 8 2 

14 

* te»^S»CC«. igflMt*^v^>4o.2g/L0?ffiK 

mczti^ti 1 1 %^y^>mi- h y ^ A*asjnL 
00 <^:/*>Bw-h y»5A(OJteasja*o.-5g) , 725$^ 

[0 0 7 2 ] »Cv»(l«cJ:-DrB{**@iRiy. 

/Co $££lfi{*10— 30mq$C 2 mKDEsM- > £ -/ -^SM 
(15:85) <b2m1<0^Da*^A«raSftILri»S0. 10 

h ^7^«SJSS8^SiCC-i4A, *r t'7 y ^7A«' 

GL-y--Tx>X?±il{NELfrRA B0N0-1 (#^Ag25m, il 
7 A^0.25mm, 7£JgJl0.4 Mm) S:JJH>fc. SJg^ff 

[0 0 7 3 ] 
C«l] 



«/§ffift i 






3HB 3 HH 


w 


H16 


3.00 


86 


100 0 


PH&-4 


0.80 


0 




AC32 


0.99 


33 


78 23 


AC321 


2.85 


92 


- 87 13 


AC323 


2.85 


92 


88 12 


AC3213 


3.64 


96 


85 15 


AC29 


3.20 


94 


92 8 



3HB : 3-tKc^>yfL/-K 3HH : 3-t Ka^^x-h 



C0 0 7 4] 



3k2 



* * C»2] 











3HB 3 HH 


Hi6 #y— 

A?* 


4.27 
3.57 
4.13 
4.06 


79 
81 
79 
82 




LOO 0 
LOO .0 
LOO 0 
LOO 0 


AC3213 


3.54 
3.60 
3.58 
2.22 


76 
77 
8L 
70 




96 4 

95 5 

96 4 
96 4 



3HB: 3-tKn^?fl/-K 3HH : 3-fc Kn *v"\3r-?V.r.— h 



(0 0 7 5 ] 



C»3] 
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H16 

AC32 

AC321 

AC323 

AC3213 



(9) 



313 <^7Z ^>Wkmmh l^'Jx^r;^ 
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3H3 3HV ~ 3KKp 



2.50 
0.77 
1.67 
1.27 
2.76 



60 
7 

55 
40 
67 



50 
30 
46 
4S 
44 



50 
67 
52 
45 
43 



3HB :3-kKo#^fb-K 3HV : 3-kKn#^'JU- F 
3HHp: 3-t Kn^ri/^y^yx.— h 



(0 0 7 6 ] ***>K£j^&£l,/ci£^ SIICC^ io 

H i 6ftrterf<y (3 -t Fn*->:/*b- h) *^etf 
^S-&BSR««flBtt6 03 HH (3-fcFo*^*t 

ji^ttafcSHfcr** ph B-4#rttsaMaa(cj:--3; 
txfi*o*y x^f-^m^Bjsae^t^ e e 3 2 

Sf^iAl/cAC 3 2»"C«3 HH (3-t Fn^V 20 

s^i* ( pc3hb-co-3hh) ) **mm&m&h?c*) 3mm 

[0 0 7 7] $6iC, AC32 1^ AC3 2 3S, A 
C321 3*frCtt3 H H^12-15^r^%<DP(3HB-co-3 
HH) *92->-96fi*%B8tU ORFlI&f, ORF3 

ae*\ ^^i^©fiM^?^^cit^';x^ 

C0 0 7 8 ] mXbtcrfi x*f-;Um^II3SiH£ 30 

■KD»R»B««*J cfc r A * * 

94SS%CDP(3HB-C0-3HH) rfiL, ft5fc<E>^7j: & 

JMflUMM* J:tn£H»ttltflJte4Ml u r i> * y x * 

[0 0 7 9] it)^';x^fiHK?©iS^AC3 2 1 3 

#L,/c<hC3. ^2iC7FTJ:-5{C3HH^4-5^;U% 
OPC3HB-CO-3HH) *76--8lSJt%SStyfc. ti%ffi(C^ 
fc^<S*nSffilSS^^r*S^U'-r>M?:j^SiSd: 40 
Urfe3 HH^4*iU%OP(3HB-co-3HH) *70fig% 
TB»0fc. IrttftT&£H 1 6«cttC©SMrFT#i; 
(3-t h'P^^fU- h) ***y^-(£#£r&/$ 

[ 0 0 8 0] <c*j. 7XD^t7-.^ t tfxFA440 
ft-C«\ ^<;U$^>K*aaR«ibr8MI«OP(3HB- 

co-3hh) *ssrsci^ii§§nri^ (#1^^7-26 

OT96SaS*<DP(3HB-co-3HH) # % $ fcffiftr^fiB'C* 
^H^tt^r^tSi L T 76- 3l®:g% (DP ( 3 H B - c 50 



0-3HH) tfSSSftSC ^fSSSO^ffi-i: 
(3HB-co-3HH) -&/Si*ttflK>r ffift>c;£ffiT 

[0 0 8 1] ^7^>^^iL;/c^ 512 
T^CCT/l^y^** • a->D7 7 ^»fttt'C*S 
HI 6|frCtt#y (3-t Fn^5>^^U-h-3-t 

KD+^-Cy U- b ) (P(3HB-CO-3*V)) £^ 

f&?Z. CtlttH16»^rS#yx^f-;ua^»3R«" 
^7CD3HHp (3-tKa^W^/x-F) 
'SrSSil^fci&'C^So PHB-4ft{C7xMf 
X • ^^^xS^CD^yx^^^m^^ite^^tf 
EE 3 2K^iAl/cAC3 2ftt«3HHp»$5 
•=&jl/%CD^y (3-tFo^fU-h-3-tFa 
*S/<rcy F - 3 - 1 Ka + W^/x- F) Htu 

^S^<£ ( P(3H8-CO-3HV-co-3HHp)) £$&tgffif*gg<& 

[0 0 8 2] § AC321ft, AC 3 23ft, A 

C 3 2 1 3 ft~Cte 3HHp»$2-8 *il/%OP(3HB-co 
-3HV— co-3HHp) £ 40~ 67m&%Wffi L s OR F 1 S£ 
■f. ORF3iH£T-. **t>te*OW:frS:$tffc$-i±-£C 
it^ 'J xxf^iR?^f t < «f 3h/c («3) o 

[0 0 8 3 ] ch6©8**%, rxp^tx ■ + + tr 

X*3fc<D^yx^^m^a^«^»4-7<03-t F 

x* fj^^rs c * itx^ 0 

[0 0 8 4 ] CH»«4) 0RF3O«t8raS 
EE3 2KK"^iUT, PCRffiWoT0RF3 
i§fi^£i#ipIU mmy'^XZ FPET-3a ( 

.5 ' ^CATATGOGCCCACAATCCCTCX^A^TAG- 3 ' (!B?lJ#-^ 
13) 4oc££>'5 ' -CTGGGATCCCCCCGTOCITAAXCAOGTC-3 ' 

(I3?'S^14) ^y^-Zv-ilt, 95"CT60#\ 68'C 
r*3(«J?toSj£* 1 t^-Y *^£LT2S-*'-/*;l4Tofc. f« 
6tl/c7 7^ * F^<^T;*JJIMBL21(CE3) ft ( 

T. NB3#iTS. 

( 0 0 8 5-] NB3*4lOOml ©LB*gt6r30"C, 4B$ 
fiflig*U H'vypfi^^^f^ht^^^F (IPT 
C) €r^#*iJ!g0.4 mMiJt-S>i:9 4C?§flDL/r^5a*g|iS 
U S6fC30-Cr2B#fHJ$»Lfc. Bftiia^BCCJ: 



(10) 
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«fH¥1 0 - 1 0 8 6 8 2 
is 

[0 0 8 6] 
[»4] 



£14 



prate* (®#G0 xy^-CoAtK7^ —KHpStt 



0 

1700 



[0087] x^^-CoAb F5*— tf«tttt*0 
F-^-CoA (S/^tt») ^StiU (zffl£o.25in 
M) t 2«S^©**te#5 8bfcff»t (263nm ) «jH 

Sr4C£«cJ:»>jfc8Mfc, — ORF3fieT-^1fA 
ha-;l7 7X 5 FPET-.3a*a»A 
L /c^^ttT' «ffift « * ^ fc < fcffl S ft ft o fc. 
(0 0 8 81 -E-Ct. 

fe. NB3«HDoJ«1£*>^*iE# 

^5 3LJ4)V- 



m) (c<fc-?r*>^^**ta.s*. x/^i-coAt 

io F**-*ffittB»*BiROfc. StSffllfrPFJ'VJMK 
K^- h y -5 a - # y r ^ y iur 5 Fyx>«»iMiWMffjfe' 
6. H2«cSW"J:5«cWRiMliW«:*— r&£C£#fc 

[0 0 8 9 ] 
[«5] 



1700 
5100 



CO 0 9 0 ] fl^n/cMx^^^-CoAt F5£- 

eccsWi^tcHBfin F>r&*Met swots -/ME 

Wtt, O R F 3 if £^CDJS*iBaj#> 6 «£ U /c7 ^ s 

S6 7^ y®SffijgCDit8c 



[0 0 9 1 ] 
C*6] 



N-3cSftTSy®^j: 



SAQSLEVGQKARLSKRPGAA <SB?!J#-$yl&) 
MSAQSLEVGOKARLSKRPGAA (ffi?ffi-§i6) 



[0 0 9 2 ] C©Ci*6, ORF3^xy^-Co 
At F5* — FUri»*C43WJW"C#fc. Me 

&. ORF3«C3-FStiSiy^;I/-CoAb F^* 

( 0 0 9 3] StSWSOiSJCSffiRCC ( S) - 3-fcFP 
^^'JJb-CoAft Fayt-f (^^ttSD 

(**»AK0.2 h/ml ) <fcBMfcSl~3*>T5 F 

7rx>v^ * U^-^ 1 F (NAD+) («&iffieo.SfiM 

) Z&fiQtZt. i/^-C o At F7*--fe'CD4$ 
Ktttf (S) -<*»J|«-C*hfcf, ^L/c (S) -3 
- t Fa*S/^^yiU-C o Ate^t Fn^-HrtDft: 



305£r{c#r>rNAD + »S7cS*arNADH3&s^JsEL* 340n 
m CC«»fl9ftWK*r£t;S. iSCcx y YA-CoAtf 
— (R) -fMtSffTC&fttf, NA-DHtt&S 

[0 0 9 4 ] mitcfjk-? <fc*>Ct. 0RF3CC3-F3tl 
£xy-T;U-C o A t F7^-f»fc*^t(i, 34 
Onm -©fflbfcBK-f bttx y^;l/-CoAfcF7^- -fe'JJftj* 

Jlla©*■^iei^irBc;r*-»fc^.T^TJE© (s) 

HfcW&rtt. NADHO^«c#5ft*BBCjb*Jl6 

40 ftfc„ 

[0 0 9 5] 
(*7] 



§37 1#%<P340mi ^£tf^gjfe^fb 

x.y ^/i^CoAb K7*— tfftHHh 0.045 
ORF Sfefcx^-OKoAfc — 0.047 

(Sj-W^&gXv'-ot^CoAt K^5— 0.146 



[ 0 0 9 6] C08»^ ffli?xy ^;U-C o At F o A t F 7 2--fer£n- FLtC^C tffiftfr-6tc 9 

(R) -»fiFSKt*5C4^6^i^-3. [ 0 0 9 7] 
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10 8 6 8 2 



* CO 0 9 8]. 

IB?JS^ : 1 
K?U<D:S£ : 1 7 8 5 " 

ISyiJOfiSa : qencmic DNA 
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co-3hh) j: < ^fiRolfifer * -SMirWfflr * 5. * 

: 

ATG AGC CAA CCA TCT TAT GGC CCG CTG TTC GAG CCC CTG GCC CAC TAG 48 
Met Ser Gin Pro Ser Tyr Gly Pro Leu Phe Glu AT a Leu Ala His Tyr 

1 5 10 15 

MT CAC AAG CTG CTG GCC ATG GCC AAG GCC CAG ACA GAG CCC ACC GCC 96 
Asn Asp Lys Leu Leu Ala Met Ala Lys Ala Gin Thr Glu Arq Thr Ala 

20 25 30 

CAC GCG CTG CTG CAG ACC AAT CTG GAC GAT CTG GGC CAC GTC CTG GAG 144 
Gin Ala Leu Leu Gin Thr Asn Leu Asp Asp Leu Gly Gin Val Leu Clu 

35 40 45 

CAC GGC AGC CAG CAA CCC TGG CAG CTG ATC CAG GCC CAG ATG AAC TGG 192 
Gin Gly Ser Gin Gin Pro Trp Gin Leu He Gin Ala Gin Met Asn Trp 

50 55 60 - 

TGG CAG GAT CAG CTC AAG CTG ATG CAG CAC AGC CTG GTC AAA AGC GCA 240 
Trp Gin Asp Gin Leu Lys Leu Met Gin His Thr Leu Leu Lys Ser Ala 
65 70 75 80 

GGC CAG CCG AGC GAG CCG GTG ATC ACC CCG CAG CGC AGC GAT CGC CCC 288 
ay Gin Pro Ser Glu Pro Val He Thr Pro Glu Arq Ser Asp Arq Arq 

85 90 95 

TTC AAG GCC GAG GCC TGG AGC GAA CAA CCC ATC TAT GAC TAC CTC AAG 336 
Phe Lys Ala Glu Ala Trp Ser Glu Gin Pro lie Tyr Asp Tyr Leu Lys 

100 10S 110 

CAG TCC TAC CTG CTC ACC GCC AGG CAC CTG CTG GCC TCG CTG GAT GCC 384 
Gin Ser Tyr Leu Leu Thr Ala Arq His Leu Leu Ala Ser Val Asp Ala 

115 120 125 

CTG GAG GGC GTC CCC CAG AAG AGC CGG GAG CGG CTC CGT TTC TTC ACC 432 
Leu Glu Gly val Pro Gin Lys Ser Arq Glu Arq Leu Arq Phe Phe Thr 

130 135 140 

CGC CAG TAC CTC AAC GCC ATG GCC CCC ACC AAC TTC CTG CCC ACC AAC 480 
Arq Gin Tyr Val Asn Ala Met Ala Pro Ser Asn Phe Leu Ala Thr Asn 
145 150 155 160 

CCC GAG CTG CTC AAG CTG ACC CTG GAG TCC GAC GGC CAG AAC CTG GTG 528 
Pro Glu Leu Leu Lys Leu Thr Leu Glu Ser Asp Gly Gin Asn Leu Val 

165 170 175 

CGC GGA CTG GCC CTC TTG CCC GAC GAT CTG GAG CGC AGC GCC GAT CAG 576 
Arq Gly Leu Ala Leu Leu Ala Glu Asp Leu Glu Arq Ser Ala Asp Gin 

ISO 185 190 

CTC AAC ATC CGC CTC ACC GAC CAA TCC CCC TTC GAG CTC GCG CGC GAT 624 
Leu Asn He Arq Leu Thr Asp Glu Ser Ala Fhe Glu Leu Gly Arq Asp 

195 200 205 

CTG CCC CTG ACC CCG CGC CGG GTG GTG CAG CCC ACC GAG CTC TAT GAG 672 
Leu Ala Leu Thr Pro Gly Arq Val Val Gin Arq Thr Glu Leu Tyr Glu 



(12) 1 0 - 1 0 8 6 8 2 

21 22 
210 215 220 

GTC ATT CAG TAC ACC CCG ACT ACC GAG ACG GTG GGC AAG ACA CCT GTG 720 
Leu lie Gin Tyr Ser Pro Thr Thr Glu Thr Val Gly Lys Thr Pro Val 
225 230 235 240 

CTG ATA GTG CCG CCC TTC ATC AAC AAG TAC TAC ATC ATG GAC ATG CGG 768 
Leu He Val Pro Pro Phe He Asn Lys Tyr Tyr He Met Asp Met Arq 

245 250 255 

CCC CAC AAC TCC CTG GTC GCC TGG CTG GTC GCC CAG GGC CAG ACG GTA 816 
Pro Gin Asn Ser Leu Val Ala Trp Leu Val Ala Gin Gly Gin Thr Val 

260 265 270 

TTC ATG ATC TCC TGG CGC AAC CCC GGC GTG GCC CAG GCC CAA ATC CAT 864 
Fhe Met lie Ser Trp Arq Asn Pro Gly Val Ala Gin Ala Gin lie Asp 

275 280 285 

GTC GAC GAC TAC GTG CTG GAT GGC CTC ATC GCC GCC CTG GAC GGC GTG 912 
Leu Asp Asp Tyr Val Val Asp Gly Val He Ala Ala Leu Asp Gly Val 

290 295 300 

GAG GCG GCC ACC GGC GAG CGG GAG GTG CAC GGC ATC GGC TAC TGC ATC 960 
Glu Ala Ala Thr Gly Glu Arq Glu Val His Gly lie Gly Tyr Cys He 
305 310 315 320 

GGC GGC ACC GCC CTG TCG CTC GCC ATG GGC TGG CTG GCG GCG CGG CGC 1008 
Gly Gly Thr Ala Leu Ser Leu Ala Met Gly Trp Leu Ala Ala Arq Arq 

325 330 335 

CAG AAC CAG CGG GTG CGC ACC GCC ACC CTG TTC ACT ACC CTG CTG GAC 1056 
Gin Lys Gin Arq Val Arq Thr Ala Thr Leu Fhe Thr Thr Leu Leu Asp 

340 345 350 

TTC TCC CAG CCC CGG GAG CTT GGC ATC TTC ATC CAC GAG CCC ATC ATA 1104 
Phe Ser Gin Pro Gly Glu Leu Gly He Phe He His Glu Pro He He 

355 360 365 

GCG GCC CTC GAG GCG CAA AAT GAG GCC AAG GGC ATC ATG GAC CGG CGC 1152 
Ala Ala Leu Glu Ala Gin Asn Glu Ala Lys Gly He Met Asp Gly Arq 

370 375 380 

CAG CTG GCG GTC TCC TTC ACC CTG CTG CGG GAG AAC AGC CTC TAC TGG 1200 
Gin Leu Ala Val Ser Phe Ser Leu Leu Arq Glu Asn Ser Leu Tyr Trp 
385 390 395 400 

AAC TAC TAC ATC GAC ACC TAC CTC AAG GGT CAG AGC CCG GTG GCC TTC 1248 
Asn Tyr Tyr He Asp Ser Tyr Leu Lys Gly Gin Ser Pro Val Ala Phe 

405 410 415 

GAT CTC CTG CAC TGG AAC AGC GAC AGC ACC AAT GTG GCG GGC AAC ACC 1296 
Asp Leu Leu His Trp Asn Ser Asp Ser Thr Asn Val Ala Gly Lys Thr 

420 42 5 430 

CAC AAC AGC CTG CTG CGC CCT CTC TAC CTG GAG AAC CAC CTG GTG AAG 1344 
His Asn Ser Leu Leu Arq Arq Leu Tyr Leu Glu Asn Gin Leu Val Lys 

435 440 445 

GCG GAC CTC AAG ATC CCC AAC ACC CGC ATC GAT CTC CGC AAC GTG AAG 1392 
Gly Glu Leu Lys He Arq Asn Thr Arq He Asp Leu Gly Lys Val Lys 

450 -455 460 

ACC CCT GTG CTG CTG GTG TCG GCG GTC GAC CAT CAC ATC GCC CTC TCG 1440 
Thr Pro Val Leu Leu Val Ser Ala Val Asp Asp His He Ala Leu Trp 
465 470 475 .480 

CAG GGC ACC TGG CAG CGC ATG AAG CTG TTT GCC GOG GAG CAG CCC TTC 1488 



(13) *SHW 1 0 - 1 0 8 6 8 2 

23 24 
Gin Cly Thr Trp Gin Gly Met Lys Leu Phe Gly Gly Glu Gin Arg Fhe 

485 ' 490 495 

CTC CTG GCG GAG TGC CGC CAC ATC GCC GGC ATC ATC AAC CCG CCG CCC 1536 
Leu Leu Ala Glu Ser Gly His He Ala Gly He He Asn Pro Pro Ala 

500 505 510 

GGC AAC MC TAC GGC TTC TCG CAC AAC GGG GCC CAG GCC GAG AGC CCG 1584 
Ala Asn Lys Tyr Gly Phe Trp His Asn Gly Ala Glu Ala Glu Ser Pro 

515 520 525 

GAG AGC TGC CTG GO, GGG GCG ACG CAC CAC GCC GGC TCC TGC TCG CCC 1632 
Glu Ser Trp Leu Ala Cly Ala Thr His Gin Gly Gly Ser Trp Trp Pro 

530 535 540 

GAG ATC ATG GGC TTT ATC CAC AAC CCT GAC GAA GOG TCA GAG CCC CTC 1680 
Glu Met Met Gly Phe He Gin Asn Arg Asp Glu Gly Ser Glu Pro Val 
545 550 555 560 

CCC GCG CGG GTC CCG GAG GAA GGG CTG GCC CCC CCC CCC GGC CAC TAT 1728 
Pro Ala Arg Val Pro Glu Glu Gly Leu Ala Pro Ala Pro Gly His Tyr 

565 570 575 

CTC AAC CTC CGG CTC AAC CCC CTG TTT GCC TGC CCA ACA GAG GAG GAC 1776 
Val Lys Val Arg Leu Asn Pro Val Phe Ala Cys Pro Thr Glu Glu Asp 

580 585 590 

GCC GCA TGA 1785 
Ala Ala 

C 0 0 9 9 ] : 2 * h : E8*K 

S2?'JcDS^ : 5 9 4 K?UO«3B : 

m?il<D%l : 7 Z sm * 
I5*J: 

Met Ser Gin Pro Ser Tyr Gly Pro Leu Phe Glu Ala Leu Ala His Tyr 

15 10 15 

Asn Asp Lys Leu Leu Ala Met Ala Lys Ala Gin Thr Glu Arg Thr Ala 

20 25 30 

Gin Ala Leu Leu Gin Thr Asn Leu Asp Asp Leu Gly Gin Val Leu Glu 

35 40 45 

Gin Gly Ser Gin Gin Pro Trp Gin Leu lie Gin Ala Gin Met Asn Trp 

50 55 60 

Trp Gin Asp Gin Leu Lys Leu Met Gin His Thr Leu Leu Lys Ser Ala 
65 70 75 80 

Gly Gin Pro Ser Glu Pro Val lie Thr Pro Glu Arg Ser Asp Arq Arg 

85 90 95 

Fhe Lys Ala Glu Ala Trp Ser Glu Gin Pro lie Tyr Asp Tyr Leu Lys 

100 105 no 

Gin Ser Tyr Leu Leu Thr Ala Arg His Leu Leu Ala Ser Val Asp Ala 

115 120 125 

Leu Glu Gly Val Pro Gin Lys Ser Arg Glu Arg Leu Arg Phe Phe Thr 

130 135 140 

Arg Gin Tyr Val Asn Ala Met Ala Pro Ser Asn Phe Leu Ala Thr Asn 
145 150 155 160 

Pro Glu Leu Leu Lys Leu Thr Leu Glu Ser Asp Gly Gin Asn' Leu Val 

165 170 175 

Arg Gly Leu Ala Leu Leu Ala Glu Asp Leu Clu Arg Ser Ala Asp Gin 
180 185 190 





(14) 



*SH}¥ 10- 1 0 8 6 3 2 



25 



26 



Leu Asn He Arg Leu Thr Asp Clu Ser Ala Phe Glu Leu Cly Arq Asp 

195 200 205 

Leu Ala Leu Thr Pro Cly Arq Val Val Gin Arq Thr Glu Leu Tyr Clu 

210 215 220 

Leu lie Gin Tyr Ser Pro Thr Thr Glu Thr Val Cly Lys Thr Pro Val 
225 . 230 235 240 

Leu lie Val Pro Pro Phe lie Asn Lys Tyr Tyr lie Met Asp Met Arq 

245 250 .255 

Pro Gin Asn Ser Leu Val Ala Trp Leu Val Ala Gin Cly Gin Thr Val 

260 265 270 

Phe Met He Ser Trp Arq Asn Pro Gly Val Ala Gin Ala Gin He Asp 

275 280 285 

Leu Asp Asp Tyr Val Val Asp Gly Val He Ala Ala Leu Asp Gly Val 

290 295 300 

Glu Ala Ala Thr Gly Glu Arq Glu Val His Gly lie Gly Tyr Cys He 
305 310 315 320 

Gly Gly Thr Ala Leu Ser Leu Ala Met Gly Trp Leu Ala Ala Arq Arq 

325 330 335 

Gin Lys Gin Arq Val Arq Thr Ala Thr Leu Phe Thr Thr Leu Leu Asp 

340 345 350 

Phe Ser Gin Pro Gly Glu Leu' Gly He Phe He His Glu Pro He He 

355 360 365 

Ala Ala Leu Glu Ala Gin Asn Glu Ala Lys Gly He Met Asp Gly Arq 

370 375 380 

Gin Leu Ala Val Ser Fhe Ser Leu Leu Arq Glu Asn Ser Leu Tyr Trp 
385 390 . 395 400 

Asn Tyr Tyr He Asp Ser Tyr Leu Lys Gly Gin Ser Pro Val Ala Fhe 

405 410 415 

Asp Leu Leu His Trp Asn Ser Asp Ser Thr Asn Val Ala Gly Lys Thr 

420 425 430 

His Asn Ser Leu Leu Arq Arq Leu Tyr Leu Glu Asn Gin Leu Val Lys 

435 440 445 

Gly Glu Leu Lys He Arq Asn Thr Arq He Asp Leu Gly Lys Val Lys 

450' 455 460 

Thr Pro Val Leu Leu Val Ser Ala Val Asp Asp His He Ala Leu Trp 
465 470 475 480 

Gin Gly Thr Trp Gin Gly Met Lys Leu Phe Gly Gly Glu Gin Arq Fhe 

485 490 495 

Leu Leu Ala Glu Ser Gly His He Ala Gly He He Asn Pro Pro Ala 

500 505 510 

Ala Asn Lys Tyr Gly Fhe Trp His Asn Gly Ala Glu Ala Glu Ser Pro 

515 520 525 

Glu Ser Trp Leu Ala Gly Ala Thr His Gin Cly Gly Ser Trp Trp Pro 

530 535 540 

Glu Met Met Gly Phe lie Gin Asn Arq Asp Glu Gly Ser Glu Pro Val 
545 550 555 560 

Pro Ala Arq Val Pro Glu Glu Gly Leu Ala Pro Ala Pro Gly . His Tyr 

565 570 575 

Val Lys Val Arq Leu Asn Pro Val Phe Ala Cys Pro Thr Glu Glu Asp 



580 



5S5 



590 



(15) 



27 
Ala Ala 
(0100] EMS* : 3 
$&1<D&2 : 3 54 



«pH¥ 1 0 - 1 0 8 6 3 2 
28 



r- : mm 

IS?'J<DH^ : qenonic DMA 



ATC ATC AAT ATG GAC CTG ATC AAC AGC TTT fiCC GAG CAG ATG CAA GGC 43 
Met Met Asn Met Asp Val lie Lys Ser Phe Thr Glu Gin Met Gin Gly 

15 10 15 

TTC GCC CCC CCC CTC ACC CCC TAC AAC CAG CTG CTG CCC ACC AAC ATC .96 



r n e 


A 1 n 

A i a 


A 1 ~ 

A I a 


Pro 


Leu 


Th r 


Arg 


Tyr Asn 


Gin 


.Leu 


Leu 


A 1 a 


Ser 


Asn 


I 1 e 










2 0 








2 5 










3 0 








GAA 


CAG 


CTG 


ACC 


C GG 


TTG 


CAG 


CTG GCC 


T C C 


GCC 


AAC 


GCC 


TAC 


GCC 


GAA 


14 4 


G 1 u 


G I n 


Leu 


Th r 


Arg 


Leu 


G 1 n 


Leu Ala 


Ser 


A 1 a 


Asn 


A 1 a 


Tyr 


A I a 


G 1 u 








3 5 










4 0 








4 5 










CTG 


GGC 


CTC 


AAC 


CAG 


TTG 


CAG 


GCC GT G 


AGC 


A A G 


GT G 


CAG 


GAC 


ACC 


CAG 


19 2 


Leu 


G I y 


Leu 


Asn 


G 1 n 


Leu 


G 1 n 


Ala Val 


O ~ _ 


Lys 


V a 1 


G 1 n 


Asp 


Th r 


G 1 n 






0 0 










5 5 








D 0 












AGC 


C i G 


/"» 

GOG 


GC C 


CTG 


GGC 


AC A 


GTG CAA 


L/ 1 u 




ALU 




AGC 


CAG 


CTC 


2 4 0 


o e r 


Leu 


A t 

A i a 


A 1 ~ 

A I a 


T 

Leu 


G 1 y 


Thr 


Val Gin 


Leu 


G 1 u 


Thr 


A I a 


Ser 


G 1 n 


Leu 




6 5 










7 0 








7 5 










8 0 




TCC 


CGC 


CAG 


ATG 


CTG 


GAT 


GAC 


ATC CAG 


A AG 


CTG 


AGC 


GCC 


CTC 


GGC 


CAG 


2 8 8 


Ser 


A r g 


G 1 n 


Me t 


Leu 


Asp 


Asp 


lie Gin 


Lys 


Leu 


Ser 


A I a 


Leu 


G 1 y 


G 1 n 












8 5 








9 0 










9 5 






CAG 


TTC 


A AG 


GAA 


GAG 


CTG 


GAT 


G T C CTG 


ACC 


GCA 


GAC 


GGC 


ATC 


A AG 


AAA 


3 3 6 


G 1 n 


Phe 


Lys 


G I u 


G. I u 


Leu 


Asp" 


Val. Leu 


Th r 


A I a 


Asp 


G 1 y 


I 1 e 


Lys 


Lys 










1 0 0 








1 0 5 










1 1 0 








AGC 


ACG 


GGC 


A A G 


GCC 


TGA 




















3 5 4 


Ser 


Th r 


G I y 


Lys 


A 1 a 









1 1 5 



(0101] ^J'J#^ 
ffiyiJO^S : 1 1 7 



4 



h*cz>>- : mm 



50 



(16) 



29 



4$HW1 0 - 1 0 8 6 8 2 
30 



Me t 
Ph e 
1 

1 0 
Ph e 
G 1 n 



Leu 
Ser 



Ser 
Leu 
6 5 

Ser 
Ly s 

9 0 
G 1 n 
Th r 



[0102] R?y»-9 ; 
BWDfiS : 4 0 5 



Met 
Th r 



A 1 a 
Leu 



G 1 u Gin 
Ser Ala 



G 1 y 
L y s 
5 0 

Leu 
G 1 u 

7 5 
A r g 
Leu 



Ph e 
A 1 a 



Ser Th r 



A s n 
G 1 u 



A 1 a 
Leu 



Leu 
A s n 
3 5 

Leu 
Va 1 

6 0 
A 1 a 
Th r 



G 1 n 
Ser 



L y s 
Asp 



G I y 
1 1 5 



Me t' 
G I n 



Pro 
A 1 a 
2 0 

Th r 
A I a 

45 
A s n 
G 1 n 



A 1 a 
A i a 



Me t 
A I a 



G I u 
Gly 
1 0 0 

L y s 



Asp Val lie Lys Ser 
Met Gin Gly 
5 

1 5 

Leu Th r Arg Tyr Asn 
Ser Asn lie 

2 5 

3 0 

Arg Leu Gin Leu Ala 
Tyr Ala G 1 u 

4 0 

Gin Leu Gin Ala Val 
Asp Thr 'Gin 
55 

Leu Gly Thr Val Gin 
Ser Gin Leu 



7 0 



8 0 



Leu Asp Asp lie Gin 
Leu Gly Gin 



8 5 



9 5 



Glu Leu Asp Val Leu 
lie Lys Lys 

1 0 5 

110 
A 1 a 

* E!?'J<£>3*I : qencmic CNA 



ATG 


AGC 


GCA 


CAA 


TCC 


CTG 


GAA 


GTA 


GGC 


CAG 


A AG 


GCC 


CGT 


CTC 


AGC 


A AG 




4 8 


Me t 


Ser 


A 1 a 


G I n 


Ser 


Leu 


G 1 u 


Va 1 


G 1 y 


G 1 n 

. 1 


Lys 


A 1 a 


Arg 


L e u 


Ser 


Lys 






1 0 








5 


1 5 








CGG 


TTC 


GGG 


GCG 


GCG 


GAG 


GTA 


GCC 


GCC 


TTC 


GCC 


GCG 


CTC 


TCG 


GAG 


G AC 




9 6 


Arg 


Phe 


G 1 y 


A I a 


A 1 a 


G 1 u 


Va 1 


A 1 a 


A 1 a 


Ph e 


A I a 


A 1 a 


Leu 


Ser 


G 1 u 


Asp 












2 0 










. 2 5 










3 0 










TTC 


A AC 


CCC 


CTG 


CAC 


CTG 


G AC 


CCG 


GCC 


TTC 


GCC 


GCC 


ACC 


ACG 


GCG 


TTC 




1 4 4 


Ph e 


Asn 


P r o 


Leu 


H i s 


Leu 


Asp 


P r o 


A 1 a 



31 

Ph e Ala 



GAG CGG 

CTC GCC 

Glu Arg 

Leu Ala 
5 0 

CTG CTG 

GGG AGC 

Leu Leu 

G 1 y S e r 
6 5 

7 5 

AGC CTC 

GTC GGG 

S e r Leu 
V a I * G 1 y 

9 0 

GAG GTG 

GAC A A G 

Glu Va 1 

Asp Lys 



ACC ACC 

GCC CTC 

Thr Thr 

Ala Leu 



GCC GTG 

Ala V a I 
1 3 0 

[0103] ejwm : 6 

E?U©*S : 1 3 4 

K?J : 

Met Ser 
Gin "Lys 
1 

1 0 

Arg" P h e 
Ph e Ala 



P h e A s n 



(17) «H¥- 1 0 - 1 0 8 6 8 2 

32 



A 1 p 


1 U 1 


Thr 


A 1 a 
n I d 


P h p 






O <J 
















A C 












c c c 

\^ o 


ATA 

A 1 j". 


u 1 U 


C A C 
O A v_/ 


n n. p 

Vjr Kjr \^ 


A T Cr 


C T G 


A CC 


O 1 




T C C 


(ZflCr 




1 Q ? 


r r 0 


I 1 £s 

lie 


v a l 


n i s 


b i y 


ivi e l 


Leu 


o e r 


Leu 


n v, n 

f n e 


b e r 


if i y 














D 0 






D U 














r* r* r* 




L Ab 


t t r* 


H 


c c c 


A A P 

/\ M. \J 


A I v., 


'PAT' 
1 A 1 


t 


T 


/** A A 




£i **■ U 


u- I y 


ur i n 


Cj 1 n 


Leu 


Pro 


u j y 


Lys 


1 1 e 


1 y r 


Leu 


Lr 1 y 


(j l n 












7 0 
















Q A 
O U 






AGC 


1 i G 


A AG 


C 1 G 


f 




T T T 
ill 


GAC 


G A G 


GTG 


ACG 






n o o 
£ o o 


o e r 


r n e 


I ire. 

Lys 


T All 

Leu 


Pro 


v a i 


P h P 

r n e 


Asp 


Glu 


v a 1 


1 n r 


A 1 « 

Ala 










0 CL 
o O 
















y o 








Cr A U 




AOL 


f* 


*T T* 
Oil 


c c c 


PAH 
VJ >\ 


o /** o 

out 


A I O 




Aul 


t* r* 




^ ^ R 
o O O 


G I u 


V a 1 


1 n r 


A 1 a 


Leu 


Arg 


U 1 u 


Fro 


l 1 e 


A 1 a 


I n r 


Leu 








10 0 










l V o 






110 










CGC 


AT C 


TTC 


ACC 


CAA 


GGC 


GGC 


GCC 


GTG 


ACG 


GGG 


G A A 




3 8 4 


Arg 


I 1 e 


P h e 


Thr 


Gin 


u i y 


Lx i y 


A 1 «-k 


v a 1 


Th r 


G 1 y 


G 1 u 






1 1 5 










1 2 0 






1 2 5 












C T C 
lr 1 U 


A A vjr 


CTG 


CCT 


T A A 


















two 


v a i 


Lys 


Leu 


Pro 












* 
























*40 










A 1 a 


Lz i n 


Ser 


L e u 


G 1 u 


Va 1 


P 1 V 

i y 


A I a 


Arg: 


Leu 


Ser 


Lys 










5 
















1 5 








G 1 y 


A 1 a 


A 1 a 


G 1 u 


Va 1 


A 1 a 


A 1 a 


A I a 


Leu 


Ser 


G 1 u 


Asp 








2 0 










2 5 






3 0 










Pro 


Leu 


H i s 


Leu 


Asp 


Pro 


A 1 a 



# 



(18) 



33 



ftmW-l 0- 1 0 8 6 8 2 
34 



Ph 


e 


A 1 


a 


A I 


a 


Th 


r 


T h r 


A 1 a 


Phe 














3 


5 












4 0 
















4 


5 












G 1 




A r 


g 


P r 


o 


I I 


Q 


V a 1 


H i s 


G 1 v 


Met 


Leu 


L e 


u 


A I 


a 


S e 


j* 


L e 




P h e 


S e r 


G 1 v 










5 


o 




























6 


o 
















L e 




L e 




G I 


v 


G I 


n 


c i n 

VJ 1 11 


T on 


Pro 


Cr 1 v 

vjt i y 


T v" q 


G 1 


v 


S e 


r 


T 1 

i i 


p 
c 


T v 


r 
1 


Leu 


Cr 1 V 

v i y 


1 n 
Vjr i XI 






6 


5 
















7 O 












7 


5 




















S p 


i 


f p 

l—i c 


1 1 

u 


o c 


r 


P h 


S 


i_i y s 


1_ S U 


P t n 
i 1 u 


V a 1 
v a i 


ill c 


V a 

V d 


1 
i 


rL 1 

vjr i 


"1/ 

y 


A <5 


P 




U 


V a 1 

V cL I 


T h r 


A 1 a 
H 1 a 






















O O 










q 


n 
























Lx 1 


u 


v a 


i 


r \ 

\jr I 


u 


v a 




I n r 


A I a 


Leu 


Ar g 


C 1 it 
Vj I U 


A s 


P 


Ly 


s 


P r 


0 


I I 


e 


A 1 a 


Th r 


Leu 


















1 0 


0 










1 0 5 


















1 1 0 










Th 


r 


Th 


r 


A r 


gr 


i I 


e 


Ph e 


Th r 


G 1 n 


G 1 y 


G 1 y 


A 1 


a 


L e 


u 


A I 


a 


Va 


1 


Th r 


G I y 


G 1 u 














1 1 


5 












1 2 0 
















12 


5 












A 1 


a 


Va 


I 


Va 


1 


Ly 


s 


Leu 


Pro 









1 3 0 

C 0 1 0 4 ] ie?iJ#^ : 7 
ie?'i<D^$ : 2 7 

kfjos : met 

m<\ : 

CCSCCSTGGA TCAAYAAGTW YTAYATC 
[0105] RJftS-? : 8 
IEW<D«S : 2 7 

EW: 

SACCCASGCS GTCCARTCSC GCCACCA 
C 0 1 0 6 ] W*im^ : 9 
ISWDfiS : 3 187 

ie?*JCDa^ : qencmic DNA 



27 



27 



M^at^ : CDS 
ISfiffiB : 384.. 734 
: CDS 

40 ??£{£g : 830.. 2611 



AGATCTGGAC 
CCACCAXCC 
CAGCCCCGCG 
CGCCTCCCTC 
TTTAC4CCAA 
ACACGTAAAC 
CTGTTCTCCC 



CGGGCTGCTG GCCTCGGCCA CGCCCGCCAC CGCCACCCCG CAGCAACCGA 
CAGAGGTTTC ATCCGCATTC CTTGGCACTC TG4ATGACCT GCCACCCTAT 
CCGCTGCGGC GACGGCGCGC CGGACCCAGT GCCTCACCTC TCCTCTGATC 
GACGGGCGTC GCTCACAAAA AAATTCAMC AGAAATTAAC ATTTA7CTCA 
ACCGCATTTG GTTGCAGAAT GCTCAAACCT GTGTTTGAAC AGAGCMCCA 
ACGCATGACA TGCAGTACCC GTAAGAAGGG CCGATTGGCC CACAACAACA 
GAACTGGAGA CCG ATC ATG AAT ATC GAC GTC ATC AAC ACC 
Met Met Asn Met Asp Val He Lys Ser 



60 
120 
180 
240 
300 
360 
410 



(19) 1 0 - 1 0 8 6 8 2 

35 36 
1 5 

TTT ACC GAG CAG ATG CAA GGC TTC GCC GCC CCC CTC ACC CGC TAC AAC 458 
Fhe Thr Glu Gin Met Gin Cly Phe Ala Ala Pro Leu Thr Arg Tyr Asn 
10 15 20 25 

CAG CTG CFG GCC AGC AAC ATC GAA CAG CTC ACC CGG TTG CAG CTG GCC 506 
Gin Leu Leu Ala Ser Asn lie Clu Gin Leu Thr Arg Leu Gin Leu Ala 

30 35 40 

TCC GCC AAC GCC TAC GCC GAA CTG GGC CTC AAC CAG TTC CAG GCC GTG 554 
Ser Ala Asn Ala Tyr Ala Glu Leu Cly Leu Asn Gin Leu Gin Ala Val 

45 50 55 

AGC AAC GTG CAG GAC ACC CAG AGC CTG GCG GCC CTC GGC ACA GTG CAA 602 
Ser Lys Val Gin Asp Thr Gin Ser Leu Ala Ala Leu Gly Thr Val Gin 

60 65 70 

CTG GAG ACC GCC ACC CAG CTC TCC CGC CAG ATG CTG GAT GAC ATC CAG 650 
Leu Glu Thr Ala Ser Gin Leu Ser Arg Gin Met Leu Asp Asp lie Gin 

75 80 85 

AAC CTG AGC GCC CTC GGC CAG CAG TTC AAC GAA GAG CTG GAT CTC CTG 698 
Lys Leu Ser Ala Leu Gly Gin Gin Phe Lys Glu Glu Leu Asp Val Leu 
90 95 100 105 

ACC GCA GAC GGC ATC AAG AAA AGC ACG GGC AAC GCC TGATAACCCC 744 
Thr Ala Asp Gly He Lys Lys Ser Thr Gly Lys Ala 

HO 115 
TGGCTGCCCG TTCGCGCAGC CACATCTCCC CATGACTCGA CGCTACGGGC TAGTTCCCGC 804 
CTCGGCTCTC GGTGAAGGAG AGCAC ATG AGC CAA CCA TCT TAT GGC CCC CTG 856 

Met Ser Gin Pro Ser Tyr Gly Pro Leu 
1 5 

TTC GAG GCC CTG GCC CAC TAC AAT GAC AAG CTG CTG GCC ATG GCC AAG 904 
Phe Glu Ala Leu Ala His Tyr Asn Asp Lys Leu Leu Ala Met Ala Lys 
10 15 20 25 . 

GCC CAG ACA GAG CGC ACC GCC CAG GCG CTG CTG CAG ACC AAT CTG GAC 952 
Ala Gin Thr Glu Arq Thr Ala Gin Ala Leu Leu Gin Thr Asn Leu Asp 

30 35 40 

GAT CTG GGC CAG GTG CTG GAG CAC GGC AGC CAG CAA CCC TGG CAG CTG 1000 
Asp Leu Gly Gin Val Leu Glu Gin Gly Ser Gin Gin Pro Trp Gin Leu 

45 50 55 

ATC CAG GCC CAG ATG AAC TGG TGG CAG GAT CAG CTC AAG CTG ATG CAG 1048 
lie Gin Ala Gin Met Asn Trp Trp Gin Asp Gin Leu Lys Leu Met an 

60 65 70 

CAC ACC CTG CTC AAA AGC GCA GGC CAC CCC AGC GAG CCG CTG ATC ACC ' 1096 
His Thr Leu Leu Lys Ser Ala Gly Gin Pro Ser Glu Pro Val He Thr 

75 80 85 

CCG GAG CGC AGC GAT CGC CGC TTC AAC GCC GAG GCC TGG AGC GAA CAA 1144 
Pro Glu Arg Ser Asp Arg Arg Phe Lys Ala Glu Ala Trp Ser Glu Gin 
90 95 100 105 

CCC ATC TAT GAC TAC CTC AAG CAG TCC TAC CTG CTC ACC GCC AGG CAC -1192 * 
Pro He Tyr Asp Tyr Leu Lys Gin Ser Tyr Leu Leu Thr Ala Arq His 

110 115 120 

CTG CTG GCC TCG GTG GAT CCC CTG GAG CGC CTC CCC CAG AAG AGC CCG 1240 
Leu Leu Ala Ser Val Asp Ala Leu Glu Gly Val Pro Gin Lys Ser Arq 
125 130 135 



(20) 

37 

CAC CGC CTG CCT TTC TTC ACC CCC CAC TAC CTC AAC 
Glu Arq Leu Arq Phe Fhe Thr Arq Gin Tyr Val Asn 

140 145 
AGC AAC TTC CTG CCC ACC AAC CCC GAG CTG CTC AAC 
Ser Asn Phe Leu Ala Thr Asn Pro Clu Leu Leu Lys 
155 160 . 165 

TCC CAC GGC CAG AAC CTG GTG CGC GGA CTG GCC CTC 
Ser Asp Gly Gin Asn Leu Val Arg Gly Leu Ala Leu 
170 . 175 180 

CTG CAC CGC AGC GCC CAT CAC CTC AAC ATC CGC CTG 
Leu Glu Arq Ser Ala Asp Gin Leu Asn He Arq Leu 

190 135 
GCC TTC GAG CTC GGG CGG GAT CTG GCC CTG ACC CCC 
Ala Phe Glu Leu Gly Arq Asp Leu Ala Leu Thr Pro 

205 210 
CAG CGC ACC GAG CTC TAT CAC CTC ATT CAG TAC AGC 
Gin Arq Thr Glu Leu Tyr Glu Leu He Gin Tyr Ser 

220 225 
ACC GTG GGC AAG ACA CCT GTG CTG ATA GTG CCG CCC 
Thr Val Gly Lys Thr Pro Val Leu He Val Pro Pro 
235 240 245 

TAC TAC ATC ATG GAC ATG CGG CCC CAG AAC TCC CTG 
Tyr Tyr He Met Asp Met Arq Pro Gin Asn Ser Leu 
250 255 260 

CTC GCC CAG GGC CAG ACC GTA TTC ATG ATC TCC TGG 
Val Ala Gin Gly Gin Thr Val Phe Met He Ser Trp 

270 275 
GTG GCC CAG GCC CAA ATC GAT CTC GAC GAC TAC GTG 
Val Ala Gin Ala Gin He Asp Leu Asp Asp Tyr Val 

285 290 
ATC GCC GCC CTG GAC GGC CTC GAG GCG GCC ACC GGC 
He Ala Ala Leu Asp Gly Val Glu Ala Ala Thr Gly 

300 305 
CAC CGC ATC GGC TAC TCC ATC GGC GGC ACC GCC CTG 
His Gly He Gly Tyr Cys He Gly Gly Thr Ala Leu 
315 320. 325 

GGC TGG CTG GCG GCG CGG CGC CAG AAG CAG CGG GTG 
Gly Trp Leu Ala Ala Arq Arq Gin Lys Gin Arq Val 
330 335 340 

CTG TTC ACT ACC CTG CTG GAC TTC TCC CAG CCC GGG 
Leu Phe Thr Thr Leu Leu Asp Phe Ser Gin Pro Gly 

350 355 
TTC ATC CAC GAG CCC ATC ATA CCG CCG CTC CAG GCG 
Fhe He His Glu Pro lie He Ala Ala Leu Glu Ala 

365 370 
AAG GGC ATC ATG CAC CCC CGC CAG CTG. GCC GTC TCC 
Lys Gly He Met Asp Gly Arq Gin Leu Ala Val Ser 

380 3S5 
CCG GAG AAC AGC CTC TAC TGG AAC TAC TAC ATC GAC 
Arq Glu Asn Ser Leu Tyr Trp Asn Tyr Tyr He Asp 



»H¥1 0 
38 

CCC ATG GCC CCC 12 8S 

Ala Met Ala Pro 

150 

CTG ACC CTG GAG . 1336 
Leu Thr Leu Glu 



10 8 6 8 2 



TTG GCC 
Leu Ala 

ACC GAC 
Thr Asp 

GGC CGG 
Gly Arq 
215 
CCG ACT 
Pro Thr 
230 

TTC ATC 
Phe He 

GTC GCC 
Val Ala 

CGC AAC 
Arq Asn 

CTG GAT 
Val Asp 
.295 
GAG CGG 
Glu Arq 
310 

TCC CTC 
Ser Leu 

CGC ACC 
Arq Thr 

GAG CTT 
Glu Leu 

CAA AAT 
Gin Asn 
375 
TTC AGC 
Phe Ser 
390 

ACC TAC 
Ser Tyr 



GAG GAT 
Glu Asp 
185 
GAA TCC 
Glu Ser 
200 

GTG GTG 
Val Val 

ACC GAG 
Thr Glu 

AAC AAG 
Asn Lys 

TGG CTG 
Trp Leu 
265 
CCG GGC 
Pro Gly 
280 

GGC GTC 
Gly Val 

GAG GTG 
Glu Val 

GCC ATG 
Ala Met 

GCC ACC 
Ala Thr 
345 
GGC ATC 
Gly He 
360 

GAG GCC 
Glu Ala 

CTG CTC 
Leu Leu 

CTC MG 
Leu Lys 



1384 



1432 



1480 



1528 



1576 



1624 



1672 



1720 



1768 



1S16 



1864 



1912 



1960 



2008 



2056 



(21) ftffiW- 1 0 - 1 0 8 6 8 2 

39 40 
395 400 .405 

GCT CAC AGC CCG GTG CCC TTC GAT CTG CTG CAC TGG AAC AGC GAC ACC 2104 
Gly Gin Ser Pro Val Ala Phe Asp Leu Leu His Trp Asn Ser Asp Ser 
410 415 420 425 ■ 

ACC AAT GTG GCG GGC AAG ACC CAC AAC AGC CTG CTG CGC CGT CTC TAC 2152 
Thr Asn Val Ala Gly Lys Thr His Asn Ser Leu Leu Arq Arq Leu Tyr 

430 435 440 

CTG CAC AAC CAG CTG> GTG AAG CCG GAG CTC AAG ATC CGC AAC ACC CGC 2200 
Leu Glu Asn Gin Leu Val Lys Gly Glu Leu Lys He Arq Asn Thr Arq 

445 450 455 

ATC GAT CTC GGC AAG GTG AAG ACC CCT GTG CTG CTG CTG TCG GCG GTG 2248 
He Asp Leu Gly Lys Val Lys Thr Pro Val Leu Leu Val Ser Ala Val 

460 465 470 

GAC GAT CAC ATC CCC CTC TGG CAG CGC ACC TGG CAC CCC ATG AAC CTC 2296 
Asp Asp His lie Ala Leu Trp Gin Gly Thr Trp Gin Gly Met Lys Leu 

475 480 485 

TTT GGC GGG GAG CAG CGC TTC CTC CTG GCG CAG TCC GGC CAC ATC GCC 2344 
Fne Gly Gly Glu Gin Arq Phe Leu Leu Ala Glu Ser Gly His He Ala 
490 495 500 505 

GGC ATC ATC AAC CCG CCG GCC GCC AAC AAG TAC GGC TTC TGG CAC AAC 2392 
Gly lie He Asn Pro Pro Ala Ala Asn Lys . Tyr Gly Phe Trp His Asn 

510 515 520 

GGG GCC GAG GCC GAG ACC CCG GAG AGC TGG CTG GCA GGG GCG ACG CAC 2440 
Gly Ala Glu Ala Glu Ser Pro Glu Ser Trp Leu Ala Gly Ala Thr His 

525 530 535 

CAC GGC GGC TCC TGG TGG CCC GAG ATG ATG GCC TTT ATC CAG AAC CGT 2488 
Gin Gly Gly Ser Trp Trp Pro Glu Met Met Gly Phe He Gin Asn Arq 

540 545 550 

GAC GAA GGG TCA GAG CCC GTC CCC GCC CCG CTC CCC CAC CAA CCC CTC 2536 
Asp Glu Gly Ser Glu Pro Val Pro Ala Arq Val Pro Glu Glu Gly Leu 

555 560 565 

GCC CCC GCC CCC GGC CAC TAT GTC AAG GTG GCG CTC AAC CCC CTG TTT 2584 
Ala Pro Ala Pro Gly His Tyr Val Lys Val Arq Leu Asn Pro Val Fhe 
570 575 580 585 

GCC TGC CCA ACA GAG CAC CAC GCC GCA TGAGCGCACA ATCCCTGGAA 2631 
Ala Cys Pro Thr Glu Glu Asp Ala Ala 
590 

GTAGGCCAGA AGGCCCGTCT CAGCAACCGC TTCCCCGCGG CGCAGGTACC CGCCTTCGCC 2691 
GCCCTCrCGG ACGACTTCAA CCCCCTCCAC CTGCACCCCC COTCCCCCC CACCACCCCG 2751 
TTCGAGCGGC CCATACTCCA CGGCATGCTG CTCGCCAGCC TCTTCTCCGG GCTGCTGGGC 2811 
CAGCAGTTGC CCGGCAAGCG GACCATCTAT CTGGGTCAAA GCCTCAGCTT CAAGCTGCCG 2871 
GTCTTTGTCG GGGACGAGGT GACGGCCGAG GTGGAGGTGA CCGCCCTTCG CGAGGACAAG 2931 
CCCATCGCCA CCCTGACCAC CCGCATCTTC ACCCAAGGCG GCGCCCTCGC CGTGACGGGG 2991 
GAAGCCCTGG TG4AGCTCCC TTAAGCACCG CCGGCACGCA CGCACAATCA GCCCGGCCCC 3051 
TGCCGCGCTG ATTGTTCTCC CCCGCTCCGC TTGCCCCGT TTTCGGGGCA ATTTGGCCCA 3111 
GCCCCTTTCC CTGCCCCGCC TAACTGCCTA AAATGGCCGC CCTGCCCTGT AGGCATTCAT 3171 
CCAGCTAGAG GAATTC 3187 
(0107] EJW* : 1 0 • m<0%L : 
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agatctcg\c cccccrccrc GCCTGGGCCA CCCCCGCCAG CCCCACCCCC GAGCAACCGA 60 
GCAGCAGGGC CAGAGGTTTC ATCCGGATTC CTTGGCAGTC TGAATGACGT GCCAGCCTAT 120 
CACCCCGCCG CCCCTCCCGC GACCGCCCCC CCXACCCACT GCGTCACCTC TCCTCTCATC 180 
CCCCTCCCTC CACCCGCGTC GCTGACAAAA AAATTCAAAC AGVAATTAAC ATTTATCTCA 240 
TTTAOCCAA ACCGCATTTG GTTGCAGAAT GCTCAAAGCT GTGTTTGVAC ACACCAAGCA 300 
ACACCTAAAC AGCGATCACA TCCACTACCC GTAAGAAGGG CCGATTCGCC CACAACAACA 360 
CTGTTCTGCC GAACTGGAGA CCGATGATCA ATATGGACCT GATCAAG\GC TTTACCGAGC 420 
AGATGCAAGG CTTCGCCGCC CCCCTCACCC CCTACAACCA CCTCCTOCCC AGCMCATCC 480 
AACAGCTGAC CCGCTTGCAG CTGGCCTCCG CCAACGCCTA CGCCGAACTG CCCCTCAACC 540 
ACTTGCXGGC CGTGAGCAAG GTCCAGGACA CCCACAGCCT GGCCCCCCTG GGCACAGTGC 600 
MCTGCAGAC CGCCAGCCAG CTCTCCCGCC AGATGCTOCA TGACATCCAG MGCKAGCG 660 
CCCTCGGCCA GCAGTTCAAG GAACAGCTGG ATGTCCTCAC CGCAGACGGC ATCAAGAAAA 720 
GCACGGGCAA GGCCTGATAA CCCCTGGCTG CCCCTTCGCG CAGCCACATC TCCCCATGAC 780 
TCGACGCTAC GGGCTAGTTC CCGCCTCGGG TGTGGGTGAA GGAGAGCACA TGAGCCAACC 840 
ATCTTATGGC CCGCTGTTCG AGGCCCTGGC CCACTACAAT GACAAGCTGC TGGCCATGGC 900 
CAAGGCCCAG ACAGACCGCA CCGCCCACGC GCTCCTGCAG ACCAATCTGG ACGATCTCGG 960 
CCAGGTGCTG GAGCAXGCA GCCAGCAACC CTGGCAGCTG ATGCAGGCCC AGATGAACTG 1020 
GTGGCAXAT CAGCTCAAGC TGATGCAGCA CACCCTGCTC AAAAGCOCAG GCCAGCCGAG 1080 
CGAGCCGCTG ATCACCCCGG AGCGCAGCCA TCGCCGCTTC AAGGCCGAGC CCTGGAGCGA 1140 
ACAACCCATC TATGACTACC TCAAGCAGTC CTACCTGCrC ACCGCCAGGC ACCTCCrGGC 1200 
CTCGGTGGAT GCCCTGGAGG GCCTCCCCCA GAAGAGCCCG GAGCGGCTCC GTTTCTTCAC 1260 

. CCGCC/OAC CTCAACGCCA TGGCCCCCAG CAACTTCCTG GCCACCAACC CCGAGCTGCT 1320 
CAACCTGACG CTGG^CTCCG ACGGCCAGAA CCTGCTGCGC GCACTGGCCC TCTTGGCCGA 1380 
GGATCTGGAG CGCAGCGCCC ATCACCTCAA CATCCGCCTG ACCGACGAAT CCGCCTTCCA 1440 
GCTCGGGCGG GVTCTGGCCC TGACCCCGGG CCOGCTGCTG CACCGCACCG AGCTCTATGA 1500 
GCTCATTCAC TACAGCCCGA CTACCGAGAC GGTGGGCAAG ACACCTCTCC TGATACTGCC 1560 
GCCCTTCATC AACA/VGTACT ACATCATGCA CATGCGGCCC CAGAACTCCC TGGTCGCCTG 1620 
GCTGGTCGCC CAGGGCCAGA CGCTATTCAT GATCTCCTGG CGCAACCCGG GCGTGCCCCA 1680 

■ GGCCCAAATC GATCTCGACG ACTACGTGGT GGATGGCGTC ATCGCCGCCC TGGACGGCGT 1740 
GGAGGCGGCC ACCGGCGAGC OZCACCTCCA CCGCATCGGC TACTGCATCG GCGGCACCGC 1800 
CCTCTCGCrC GCCATGGGCr GCCTCGCGOG CCGCCGGCAG AAGCACCGCC TGCCCACCGC 1860 
CACCCTGTTG ACTACCCTCC TGO a CTTCTC CCAGCCCGGG GAGCTTGGCA TCTTCATCCA 1920 
CGAGCCCATC ATAGCGGCGC TCGAGGCGCA AAATGAGGCC AAGGGCATCA TGGAGGGGCG 1980 
CCAGCTGGCG CTCTCCTTCA GCCTGCTGCG GGAGAACAGC CTCTACTGGA ACTACTACAT 2040 
CGACAGCTAC CTCAACXCTC AGAGCCCGCT GGCCTTCCAT CTGCTGCACT GGAACAGCGA 2100 
CACCACCAAT GTGGCGGGCA AGACCCACM CAGCCTGCTG CGCCGTCTCT ACCTGCAGAA 2160 
CCAGCTGCTG AAGGGGGAGC TCAAGATCCC CAACACCCGC ATCGATCTCG GCAAGCTGAA 2220 
CACCCCTGTG CTGCTCCTCT CGGCGGTGCA CGATCACATC GCCCTCTGGC AGGGCACCTG 2280 
GCAGGGCATG AAGCTCTTTG GCGGGGAGCA GCGCTTCCTC CTGGCGGACT CCGGCCACAT 2340 
CCCCGGGATC ATCVCCCGC GCGCCGCCM CAAGTACGGC TTGTGGCACA ^CGGGGCCGA 2400 
GGCCGAGAGC CCGGAGAGCT GGCTGGCAGG GGCGACGCAC CAGGGCGGCT CCTGCrCGCC 2460 
GGAGATGATG GGCTTTATCC ACAACCCTCA CGAAGGGTCA GAGCCCCTCC CCGCGCGGGT 2520 
CCGGG/3GCAA CGGCTGGCCC CCGCCGCCGG CCACTATCTC AAGGTGCGGC TCAACCCCCT 2580 
GTTTGCGTGC CCAACAGAGG AGCACGCCGC ATG AGC GCA CAA TCC CTG GAA GTA 2634 



GGC CAG AAG GCC CGT CTC AGC AAG CGG TTC GGG CCG GCG GAG GTA CCC 2682 



Met Ser Ala Gin Ser Leu Glu Val 
1 5 



(23) *»8¥1 0- 1 0 8 6 8 2 

43 44 
Gly Gin Lys Ala Arq Leu Ser Lys Arq Phe Cly Ala Ala Glu Val Ala 

10 15 ' 20 

GCC TTC GCC GCG CTC TCG GAG GAC TTC AAC CCC CTG CAC CTG GAC CCG 2730 
Ala Phe Ala Ala Leu Ser Glu Asp Phe Asn Pro Leu His Leu Asp Pro 
25 30 35 40 

CCC TTC GCC GCC ACC AGG GCG TTC GAG CCG CCC ATA GTC CAC CCC ATC 2778 
Ala Phe Ala Ala Thr Thr Ala Phe Glu Arq Pro He Val His Cly Met 

......... 45 50 55 

CTG CTC GCC AGC CTC TTC TCC GGC CTG CTG GGC CAC CAC TTC CCG GCC 2826 
Leu Leu Ala Ser Leu Phe Ser Gly Leu Leu Gly Gin Gin Leu Pro Gly 

60 65 70 

AAG GGG AGC ATC TAT CTG GGT CAA AGC CTC AGC TTC AAG CTG CCG CTC 2874 
Lys Gly Ser lie Tyr Leu Gly Gin Ser Leu Ser Phe Lys Leu Pro Val 

75 80 85 

TTT GTC GGG GAC GAG CTG ACG GCC GAG GTG CAC CTG ACC GCC CTT CGC 2922 
Phe Val Gly Asp Glu Val Thr Ala Glu Val Glu Val Thr Ala Leu Arq 

90 95 100 

GfiC GAC AAG CCC ATC GCC ACC CTG ACC ACC CGC ATC TTC ACC CAA GGC 2970 
Glu Asp Lys Pro He Ala Thr Leu Thr Thr Arq He Phe Thr Gin Cly 
105 110 115 120 

GGC GCC CTC GCC GTC ACG GGG GAA GCC GTG GTC AAG CTG CCT .3012 
Gly Ala Leu Ala Val Thr Gly Glu Ala Val Val Lys Leu Pro 

125 130 
TAAGCACCGG CGGCACGCAG GCACAATCAG CCCCGCCCCT GCCGGGCTGA TTCTTCTCCC 3072 
CCGCTCCGCT TGCCCCCTTr TTCGGGGCAA TTTGGCCCAG GCCCTTTCCC TGCCCCGCCT 3132 
AACTGCCTAA AATGGCCCCC CTGCCGTGTA GGCATTCATC CAGCTAGAGG AATTC 3187 
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GCCATATGAG CGCACAATCC CTCCAAGTAG 
(0111] SE?"J#^ : 1 4 
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CTGGGATCCG CCGGTGCTTA AGGCAGCTTG 
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45 46 
Ser Ala Gin Ser Leu Glu Val Gly Gin Lys Ala Arq Leu Ser Lys Arq 

1 5 ' 10 15 

Fhe Gly Ala Ala 
20 

[0113] ge?iJ#-^ : 1 6 * h : mm 

ISWDfiS : 2 1 EFJOaS : F 

I2?J : 

Met Ser Ala Gin Ser Leu Glu Val Gly Gin Lys Ala Arq Leu Ser Lys 

1 5 10 15 

Arq Phe Gly Ala Ala 
20 
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